LLVM and SPIRV-LLVM-Translator pulldown (WW16 2024) #13484

iclsrc · 2024-04-18T18:34:22Z

LLVM: llvm/llvm-project@af7a82f
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@e1f7ebee5ed8b90

Now readNext defaults to unaligned accesses. This patch drops unaligned to improve readability.

…arge (#88172) Following #78348, we should treat functions with an explicit section as small, unless the section name is (or has the prefix) ".ltext". Clang emits global initializers into a ".text.startup" section on Linux. If we mix small/medium code model object files with large code model object files, we'll end up mixing sections with and without the large section flag. Reland of #87838 with a check for non-ELF platforms in TargetMachine::isLargeGlobalValue(), otherwise MCJIT on Windows tests fail.

The test shows that name preservation doesn't work properly when --include-generated-funcs is used.

This test is meant to demonstrate an upcoming change that replaces basic block labels by FileCheck patterns.

Add the fir.cuda_deallocate operation that perform device deallocation of data hold by a descriptor. This will replace the call to AllocatableDeallocate from the runtime. This is a companion operation to the one added in #88586

CONFLICT (content): Merge conflict in clang/lib/CodeGen/CGExprCXX.cpp

VPBlendRecipes don't get lowered to phis and usually do not appear at the beginning of blocks, due to their masks appearing before them. This effectively relaxes an over-eager verifier message. Fixes llvm/llvm-project#88297. Fixes llvm/llvm-project#88804.

This is essentially the same as #82404 but has the `nneg` flag which allows the backend to reliably undo the transform. Closes #88299

This vendor extension has the same shift_add as zba, and most of the same patterns are duplicated. Enable it here too so the configurations don't diverge.

This reverts commit c6e38b9. Causes miscompiles, see comments on #78304.

…educe.{fmin,fmax,fmaximum,fminimum}`; NFC

…ce.{fmin,fmax,fmaximum,fminimum}` Closes #88408

Fix a use of inline code markup to have a non-word character after the ending delimiter as required by reST.

libc.incude.fenv -> libc.include.fenv

…iles (#88994) followed the same style used in "TensorOps.td".

CONFLICT (content): Merge conflict in libclc/CMakeLists.txt

…e backend. (#88954) This is largely a revert of commit e817966. As #88029 shows, there exists hardware that only supports unaligned scalar. I'm leaving how this gets exposed to the clang interface to a future patch.

The previous patch was reverted because the test fails to build when libsanitizers is not present. This patch catches the BuildError exception and skips the test appropriately. This patch tests LLDB integration with libsanitizers for ASan. rdar://111856681

- With the memory operand as the destination, 'mr' variants of CFCMOV works like STORE and their memory operands should be input operands instead of output ones. Reviewers: XinWang10, arsenm Pull Request: llvm/llvm-project#88970

This change updates a few of the transformations in foldFMulReassoc to respect absent fast-math flags in cases where fmul and fdiv, fadd, or fsub instructions were being folded but the code was only checking for fast-math flags on the fmul instruction and was transferring flags to the folded instruction that were not present on the other original instructions. This fixes llvm/llvm-project#82857

…… (#88249) …se of tensor pack When the vector sizes are not passed as inputs to the vector transform operation, the vector sizes are queried from the static result shape in the case of tensor.pack op.

Since 97fe519, in ARM64EC mode, we don't define `__aarch64__`. Fix various preprocessor guards to account for this.

See https://discourse.llvm.org/t/rfc-fastmath-flags-support-in-complex-dialect/71981

This reverts commit 7d4e8c1. Contrary to the commit description, this does cause large compile-time regressions (up to 10% on individual files).

- Those special register stores are STORE and their memory operands are input operands instead of output ones. Reviewers: JDevlieghere, arsenm, yinying-lisa-li, koachan, PeimingLiu, jyknight, aartbik, matthias-springer Reviewed By: arsenm Pull Request: llvm/llvm-project#88971

- If a def operand includes multiple sub-operands, count them when generating instr info. - Found issues in x86 and sparc backends, where memory operands of store or store-like instructions are wrongly placed in the output list. Reviewers: jayfoad, arsenm, Pierre-vh Reviewed By: arsenm Pull Request: llvm/llvm-project#88972

jsji · 2024-04-22T13:16:11Z

@intel/dpcpp-cfe-reviewers @intel/dpcpp-spirv-reviewers @vmaksimo Can you have a look and add comments/approval explicitly so that this can be merged. Thanks.

vmaksimo

Approve for llvm-spirv fixes.

smanna12

CFE change looks good to me. Thanks

jsji · 2024-04-22T14:58:32Z

Thank you @vmaksimo and @smanna12 .

@bader @intel/llvm-gatekeepers This is ready for merge, arc failures are due to not available runners, common to other PRs. Thanks.

…needed to authenticate signed pointers (#86923) To authenticate pointers, CodeGen needs access to the key and discriminators that were used to sign the pointer. That information is sometimes known from the context, but not always, which is why `Address` needs to hold that information. This patch adds methods and data members to `Address`, which will be needed in subsequent patches to authenticate signed pointers, and uses the newly added methods throughout CodeGen. Although this patch isn't strictly NFC as it causes CodeGen to use different code paths in some cases (e.g., `mergeAddressesInConditionalExpr`), it doesn't cause any changes in functionality as it doesn't add any information needed for authentication. In addition to the changes mentioned above, this patch introduces class `RawAddress`, which contains a pointer that we know is unsigned, and adds several new functions for creating `Address` and `LValue` objects. This reapplies d9a685a, which was reverted because it broke ubsan bots. There seems to be a bug in coroutine code-gen, which is causing EmitTypeCheck to use the wrong alignment. For now, pass alignment zero to EmitTypeCheck so that it can compute the correct alignment based on the passed type (see function EmitCXXMemberOrOperatorMemberCallExpr).

Fixes for sycl

jsji · 2024-04-23T03:05:41Z

Incorporate #13358 into pulldown.
@premanandrao @ldrumm Can you please have a look to see whether I do it right to resolve the conflicts and keep the correct authorship. Thanks.

ldrumm · 2024-04-23T11:34:18Z

Incorporate #13358 into pulldown. @premanandrao @ldrumm Can you please have a look to see whether I do it right to resolve the conflicts and keep the correct authorship. Thanks.

f27e5de

Looks great. Thanks for making sure this is done right

jsji · 2024-04-23T13:25:34Z

@bader @intel/llvm-gatekeepers This is ready for merge now. Thanks.

jsji · 2024-04-24T00:25:20Z

@bader is on vacation. @intel/llvm-gatekeepers Can someone issue a /merge to get this merged. Thanks.

aelovikov-intel · 2024-04-24T03:15:59Z

/merge

bb-sycl · 2024-04-24T03:16:27Z

Wed 24 Apr 2024 03:16:27 AM UTC --- Start to merge the commit into sycl branch. It will take several minutes.

bb-sycl · 2024-04-24T03:20:43Z

Wed 24 Apr 2024 03:20:42 AM UTC --- Merge the branch in this PR to base automatically. Will close the PR later.

jsji · 2024-04-24T12:56:03Z

/merge

Thank you @aelovikov-intel

kazutakahirata and others added 30 commits April 16, 2024 12:47

[llvm] Drop unaligned from calls to readNext (NFC) (#88841)

f430e37

Now readNext defaults to unaligned accesses. This patch drops unaligned to improve readability.

update_test_checks: pre-commit a new test

191be2a

The test shows that name preservation doesn't work properly when --include-generated-funcs is used.

update_test_checks: add new test

e770249

This test is meant to demonstrate an upcoming change that replaces basic block labels by FileCheck patterns.

update_test_checks: remove an unused function

377a276

[flang][cuda] Add fir.deallocate operation (#88839)

9ec6c5d

Add the fir.cuda_deallocate operation that perform device deallocation of data hold by a descriptor. This will replace the call to AllocatableDeallocate from the runtime. This is a companion operation to the one added in #88586

Merge from 'main' to 'sycl-web' (3 commits)

6eb8559

CONFLICT (content): Merge conflict in clang/lib/CodeGen/CGExprCXX.cpp

[InstCombine] Add canonicalization of sitofp -> uitofp nneg

b6bd41d

This is essentially the same as #82404 but has the `nneg` flag which allows the backend to reliably undo the transform. Closes #88299

[RISCV] Enable mul strength reduction for XTheadBa

885b8d9

This vendor extension has the same shift_add as zba, and most of the same patterns are duplicated. Enable it here too so the configurations don't diverge.

Revert "Reapply "[LV] Improve AnyOf reduction codegen. (#78304)""

c6e0162

This reverts commit c6e38b9. Causes miscompiles, see comments on #78304.

[ValueTracking] Add tests for computeKnownFPClass of `llvm.vector.r…

266b2a2

…educe.{fmin,fmax,fmaximum,fminimum}`; NFC

[ValueTracking] Implement computeKnownFPClass for `llvm.vector.redu…

9eeae44

…ce.{fmin,fmax,fmaximum,fminimum}` Closes #88408

Clang Release Notes: Fix reST formatting

d19bd05

Fix a use of inline code markup to have a non-word character after the ending delimiter as required by reST.

[memprof] Use SizeIs (NFC) (#88984)

3074060

[libc][NFC] fix typo in fenv type proxy headers (#88982)

b1385db

libc.incude.fenv -> libc.include.fenv

[mlir][sparse][NFC] switching to using let argments/results in td f…

8aa061f

…iles (#88994) followed the same style used in "TensorOps.td".

Merge from 'main' to 'sycl-web' (9 commits)

2be7088

CONFLICT (content): Merge conflict in libclc/CMakeLists.txt

Merge from 'sycl' to 'sycl-web' (1 commits)

5005c0b

[X86] Fix instr desc of CFCMOV's 'mr' variants

50a3717

- With the memory operand as the destination, 'mr' variants of CFCMOV works like STORE and their memory operands should be input operands instead of output ones. Reviewers: XinWang10, arsenm Pull Request: llvm/llvm-project#88970

[bazel] Add support for lldb-server (#88989)

1bc0921

[mlir][vector] Determine vector sizes from the result shape in the ca…

ce5381e

…… (#88249) …se of tensor pack When the vector sizes are not passed as inputs to the vector transform operation, the vector sizes are queried from the static result shape in the case of tensor.pack op.

[ARM64EC] Fix arm_neon.h on ARM64EC. (#88572)

8c9f45e

Since 97fe519, in ARM64EC mode, we don't define `__aarch64__`. Fix various preprocessor guards to account for this.

[mlir][complex] Fastmath flag for complex angle (#88658)

8c9d814

See https://discourse.llvm.org/t/rfc-fastmath-flags-support-in-complex-dialect/71981

Revert "[SLP]Attempt to vectorize long stores, if short one failed."

efd6055

This reverts commit 7d4e8c1. Contrary to the commit description, this does cause large compile-time regressions (up to 10% on individual files).

jsji requested review from a team and bader as code owners April 19, 2024 21:30

jsji requested a review from MartinWehking April 19, 2024 21:30

jsji temporarily deployed to WindowsCILock April 19, 2024 22:07 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock April 20, 2024 00:02 — with GitHub Actions Inactive

vmaksimo approved these changes Apr 22, 2024

View reviewed changes

smanna12 approved these changes Apr 22, 2024

View reviewed changes

jsji self-assigned this Apr 22, 2024

ahatanak and others added 2 commits April 22, 2024 19:34

Reland #86923 with fixes

3ce9126

Fixes for sycl

jsji temporarily deployed to WindowsCILock April 23, 2024 03:18 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock April 23, 2024 04:08 — with GitHub Actions Inactive

ldrumm approved these changes Apr 23, 2024

View reviewed changes

bb-sycl approved these changes Apr 24, 2024

View reviewed changes

bb-sycl merged commit 425dbd7 into sycl Apr 24, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLVM and SPIRV-LLVM-Translator pulldown (WW16 2024) #13484

LLVM and SPIRV-LLVM-Translator pulldown (WW16 2024) #13484

iclsrc commented Apr 18, 2024 •

edited by jsji

Loading

jsji commented Apr 22, 2024

vmaksimo left a comment

smanna12 left a comment

jsji commented Apr 22, 2024 •

edited

Loading

jsji commented Apr 23, 2024

ldrumm commented Apr 23, 2024

jsji commented Apr 23, 2024

jsji commented Apr 24, 2024

aelovikov-intel commented Apr 24, 2024

bb-sycl commented Apr 24, 2024

bb-sycl commented Apr 24, 2024

jsji commented Apr 24, 2024

LLVM and SPIRV-LLVM-Translator pulldown (WW16 2024) #13484

LLVM and SPIRV-LLVM-Translator pulldown (WW16 2024) #13484

Conversation

iclsrc commented Apr 18, 2024 • edited by jsji Loading

jsji commented Apr 22, 2024

vmaksimo left a comment

Choose a reason for hiding this comment

smanna12 left a comment

Choose a reason for hiding this comment

jsji commented Apr 22, 2024 • edited Loading

jsji commented Apr 23, 2024

ldrumm commented Apr 23, 2024

jsji commented Apr 23, 2024

jsji commented Apr 24, 2024

aelovikov-intel commented Apr 24, 2024

bb-sycl commented Apr 24, 2024

bb-sycl commented Apr 24, 2024

jsji commented Apr 24, 2024

iclsrc commented Apr 18, 2024 •

edited by jsji

Loading

jsji commented Apr 22, 2024 •

edited

Loading